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Abstract. We introduce the concept of a spectral shift operator and use it 
to derive Krein's spectral shift function for pairs of self-adjoint operators. Our 
principal tools are operator- valued Herglotz functions and their logarithms. 
Applications to Krein's trace formula and to the Birman-Solomyak spectral 
averaging formula are discussed. 



1. Introduction 



Krein's spectral shift function 51 1, |52| . pM has received enormous atten- 

Dplications in a variety of fields including 



tion in the past due to its widespread a 

scattering theory (§, Ch. 19, @, & |§, @, @, @, §|, @, @, Ch 



relative index theory (M , UM , |15], fl38|, [{59|, Ch. X), spectral averaging ( 
jyj, 127ft , p2[ , |66| , |79[]-[{Slf) and its application to localization properties 

of random Hamiltonians ([0, |f(J, ||, ||, Ch. VIII, @, @, |1, 
Ch. V, |78|, p9f), eigenvalue counting functions and spectral asymptotics ( [ 10 1 , 
§§, || , (3fj, ^-@), semi-classical approximation @) and trace 

formulas for one-dimensional Schrodinger and Jacobi operators (|3l| [^3|, |j35f , 
p9| ). Detailed reviews on the spectral shift function and its applications up to 
1993 were published by Birman and Yafaev [|l2), jL3|. These two papers contain 
extensive bibliographies and the interested reader will find many additional refer- 
ences therein. Historically, the concept of a spectral shift function was introduced 
by I. M. Lifshits |§, |7j. 

Our main contribution to this circle of ideas is the introduction of a spectral 
shift operator S(A, Ho,H) for a.e. A £ I, associated with a pair of self-adjoint 
operators H , H = H a + V with V G Bi(H) (H a complex separable Hilbert 
space). In the special cases of sign-definite perturbations V > and V < 0, 
E(\,H ,H) turns out to be a trace class operator in H, whose trace coincides 
with Krein's spectral shift function £(A, Ho,H) for the pair (Hq,H). While the 
special case V > has previously been studied by Carey |l8| , our aim here is to 
treat the case of general interactions V by separately introducing the positive and 
negative parts V± — (\V\ ± V)/2 of V. In general, if V is not sign-definite, then 
S(A, Ho, H) is not necessarily of trace class. However, we will introduce trace class 
operators S±(A) naturally associated with V±, acting in distinct Hilbert spaces 
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H±, such that 

£(A, H ,H) = tr w+ (3+ (A)) - tr w _ (S_(A)) for a.c. A e R. (1.1) 

(An alternative approach to £(A, Hq, H), which does not rely on separately intro- 
ducing V+ and V-, will be discussed elsewhere [j36|.) 

Our main techniques are based on operator-valued Herglotz functions (con- 
tinuing some of our recent investigations in this area pOfl ) and espe- 
cially, on a detailed study of logarithms of Herglotz operators in Section ||. In 
Section || we introduce the spectral shift operator S(A, Hq, H) associated with the 
pair (Ho,H) and relate it to Krein's spectral shift function £(A, Hq, H) and his 
celebrated trace formula ]5lJ . Finally, Section |] provides an application to spectral 
averaging originally due to Birman and Solomyak |p"lf and hints at operator- valued 
generalizations thereof. A number of additional applications of this formalism will 
appear elsewhere f36f . 

2. Logarithms of Operator- Valued Herglotz Functions 

The principal purpose of this section is to study operator-valued Herglotz 
functions and particularly their logarithms and associated representation theo- 
rems. In this manner we shall obtain operator-valued generalizations of some of 
the classical results on exponential Herglotz representations studied by Aronszajn 
and Donoghue j|. 

In the following Ti denotes a complex separable Hilbert space with scalar 
product (■, -)u (linear in the second factor) and norm || • ||^, I-h the identity 
operator in Ti, B{TL) the Banach space of bounded linear operators defined on 
H., BpjH), p > 1 the standard Schatten-von Neumann ideals of B(H) (cf., e.g., 
jyj, J77J) and C + (resp., C_) the open complex upper (resp., lower) half-plane. 
Moreover, real and imaginary parts of a bounded operator T E B(H) are defined 
as usual by Re(T) = (T + T*)/2, Im(T) = (T — T*)/(2i). 

Definition 2.1. M : C+ — * B(7i) is called an operator-valued Herglotz function 
(in short, a Herglotz operator) if M is analytic on C+ and Im(M(z)) > for all 

z e C+. 

Theorem 2.2. (Birman and Entina ||, de Branges @, Naboko @-j62j.) Let 

M : — » B(H.) be a Herglotz operator. 

(i) Then there exist bounded self-adjoint operators A = A* E B{TL), < B G B{TL), 
a Hilbert space K, D Ti, a self-adjoint operator L — L* in K,, a bounded nonnegative 
operator < R £ B(IC) with R\>cen = such that 

M(z) = A + Bz + R 1/2 {I K + zL){L - z)~ l R l/,2 \n (2.1a) 
= A + {B + R\ H )z + (1 + z 2 )R x ' 2 {L - z)- 1 ^ 2 ^. (2.1b) 

(ii) Letp>\. Then M(z) e B P (H) for all z E C+ if and only if M{z ) E B P (H) 
for some zq E C+. In this case necessarily A, B, R E B p (Tt). 
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(iii) Let M(z) G B\{TL) for some (and hence for all) z G C+. Then M(z) has 
normal boundary values M(X + iO) for (Lebesgue) a.e. A G R in every B p (TL)-norm, 
p > 1. Moreover, let {£x(A)}agR be the family of orthogonal spectral projections of 
L in K,. Then R 1, ' 2 El(X)R 1 ' 2 is Bi(7i) - differ entiable for a.e. A G R and denoting 
the derivative by d(R 1 / 2 EL(X)R 1 / 2 )/d\, lm(M(z)) has normal boundary values 
Im(M(A + iO)) for a.e. A G R in B\{TL)-norm given by 

lim || 7 r- 1 Im(Af (A + is)) - d{R 1/2 E L {\)R 1/2 \ n ) / 'd\\\ Bl{n} =0 a.e. (2.2) 



Actually, the normal boundary values M (A + iO) in Theorem |2.2| (iii) can 
be replaced by nontangential ones. Since this distinction will play no role in the 
remainder of this paper we omit the corresponding details. 



Theorem 2.2 (i) follows by considering the quadratic form (tp, M(z)ip)-n, re- 
sulting in a scalar Herglotz function, in combination with Naimark's dilation the- 
orem (cf., e.g., Theorem 1 in Appendix I of |lq]). Details can be found in pc[ |. For 
Theorem (ii), (iii) we refer to pi"] , | ]62| . In particular, p cannot be chosen equal 
to 1 in Theorem^ (iii) (cf. Q). Moreover, if M(z a ) G B P (H) for some z G C+ 
and some p > 1, then M(z) need not even have boundary values M(X + iO) in the 
weak topology of H for a.e. A G R. In fact, the quadratic form (/, M(A + ie)f)u 
may converge for / in a fixed set V C H (independent of A) to (/, M(A + i0)f) n 
for a.e. A G K as e [ 0, however, M(A + «0) may be a densely defined unbounded 
operator in H for a.e. A G R (cf. |]6l|). 

Originally, the existence of normal limits M(A + iO) for a.e. A G R in B2CH)- 
norm, in the special case A = 0, B = —R\n: assuming M(z) G Bi(H), was proved 



by de Branges [|26| in 1962. (The more general case in (2.1) can easily be reduced to 
this special case.) In his paper ^6|, de Branges also proved the existence of normal 
limits Im(M(A+zO)) for a.e. A G R in Si(W)-norm and obtained ( |2.2| ). These results 
and their implications on stationary scattering theory were subsequently studied 
in detail by Birman and Entina @ , § • (Textbook representations of this material 
can also be found in ||, Ch. 3.) 

Invoking the family of orthogonal spectral projections {-El(A)}a<er of L, ( |2.l| ) 
then yields the familiar representation 

M(z) = A + Bz+ f (1 + X 2 )d{R 1 ' 2 E L (\)R 1 ' 2 \ H ){{\ - z)' 1 - A(l + A 2 )" 1 ), 

JR 

(2.3) 

where for our purpose it suffices to interpret the integral in (|2.3|) in the weak sense. 



Further results on representations of the type ( |2.5| ) can be found in [|16|, Sect. 1.4 
and @. 

Before we continue our investigations on Herglotz operators a few comments 
concerning our terminology are perhaps in order. The representation ( [2.3] ) in the 
special case of scalar Herglotz functions is due to Riesz and Herglotz, respectively, 



Nevanlinna. The former authors discussed the analog of (2.3) in the open unit 



disk D C C , wh ereas the latter studied (2.3) in C+. As a consequence, functions 



of the type (2J3) are frequently called Herglotz or Nevanlinna functions. Moreover, 
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researchers in the former Soviet Union coined the term i?-functions. Here we follow 
the traditional terminology in mathematical physics which seems to prefer the 
notion of Herglotz functions. 

Since we are interested in logarithms of Herglotz operators, questions of their 
invertibility naturally arise. The following result clarifies the situation. 

Lemma 2.3. Suppose M is a Herglotz operator with values in B{TL). IfM(zo)^ 1 G 
B(H) for some z G C+ then Af(z)" 1 G B{H) for all z G C+. 

Proof. Suppose there is a z\ G C+ and a sequence {e„}„ e N C H such that | \e n \ \n = 
1, n G N but \iva. n ^ <x \\M{zx)e n \\u = 0. Then lim„^ oc (e n , lm(M(zi))e n ) n = 
lim^oo ||(Im(M(2;i))) 1 / 2 e„|| w = and hence 

lim ||Im(M(z 1 ))e„|| w = 0. (2.4) 



By ( p.laQ , 

Im(M(z)) = lm(z)(B + R 1 ' 2 ^ + L 2 )((L - Rc(z)) 2 + (Im(z)) 2 )- 1 R 1/2 \ n ) 

= lm(z){B + R 1/2 C(z)R 1/2 \ H ), (2.5) 

where C(z) = (I yc + L 2 K(L-Re(z)) 2 + (Im(z)) 2 )- 1 > is invertible in JC, C(z)- 1 G 
B(K), z G C+. Thus, @ implies lim„^ 00 ((e„, Be„) H + OR 1 / 2 ^, c{z 1 )R 1 / 2 \ H 
e n )n) = and hence lim„_ ) . 00 \\B l / 2 e n \\ n = 0, and lim^oo ||i?e„|| w = 0, lim^oo 
||-R 1 ^ 2 |w e ri||w = 0. Consequently, 

M( Zl )e n = Ae n + z x Be n + {R 1/2 (I K + Zl L)(L - z^R 1 ' 2 ^ (2.6) 

with i? 1 / 2 (/ K + z x L){L - zx)- 1 G B(/C) yields lim JWOO ||Ae n || H = 0. Applying 
(2.1a) again we infer linin^oo ||M(z)e„||-H = for all z G C+ contradicting our 
hypothesis M(zo)^ 1 e B(H). This argument shows two facts. First, by choosing 
e n = e G ker(M(zi)) it yields ker(M(z)) = {0} for all z G C+, and second, it 
proves the boundedness of M(z) _1 : ran(M(z)) — > H, z G C+. In particular, one 
infers that ran(Af(z)) = dom(A/(z) _1 ) is a closed subspace of 7i. By exactly the 
same argument one derives {0} = ker(M(z)*) = (ran(M(z)) , z G C+ and thus, 
M(z)" 1 G for all z G C+. □ 



Lemma 2.3 admits the following generalization in connection with paramet- 
rices, that is, generalized inverses, familar from treatments of the Calkin alge- 
bra. In the following we denote by J-(Tt) the algebra of finite-rank operators 
on H and recall that T is called a left (resp., right) parametrix of M(zq) if 
(TM{z ) - In) G T(H) (resp., (M(z )T - J w ) G ^(W)). 

Lemma 2.4. Suppose M is a Herglotz operator with values in B(Ti.). If M(zo) 
/ias a Ze/t (resp., right) parametrix for some zq G C+, i/ien M(z) /ias a Ze/t and 
right parametrix for all z G C+ . 



Proof. From the proof of Lemma |2.3| one infers 

ker(M(zi)) = ker(M (z 2 )) for all Zi,z 2 G C+, (2.7a) 
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ker(M(z)*) = ker(M (z)) = (ran(M (z))) x , z G C+ (2.7b) 
and hence M(z) can be decomposed as 

M{z) = ( M Q (z) °X H = H® kBr(Af (*)), (2.8) 

where 



ker(M(z)) = {0}, ran(M(z)) = W, z e C+. (2.9) 

From TM(z ) - In = F G F{H) one concludes ker(M(z )) C ker(7 w + F) and 
hence 

dim(ker(Af(z))) < oo for all z G C+. (2.10) 

Denote by P the orthogonal projection onto H, H = PH then PTM(zq)P = 
PTPM(zo) = P{In + F)P = I n + F. In order to prove that M(z )~ G B(H) we 
argue as follows: Suppose there is a sequence {e„} ne N C Ti, ||e„||^ = 1 such that 

km \\M(z )e n \\ n = 0. (2.11) 

n— >oo 

Hence lim^oo ||PTPM(z )e n ||^ = lim^oo ||(I^ + -F)e„||^ = 0. Next, let £Li be 
the spectral projection onto the geometric eigenspace of F corresponding to the 
eigenvalue —1, and 

E^i+Etx = I n (2.12) 

(here + denotes the direct sum but not necessarily the orthogonal direct sum ©). 
Then 

lim \\(I fl + F)E± l e n \\ n = Q. (2.13) 

n— *oo 

Since + F is boundedly invertible on E_]?t, one infers 

lim H^enll^ = 0. (2.14) 

n— »oo 

Introducing e n = E-\e n , n E N, one concludes from ( pTTl| ), (pl^)-(p^) that 
lim, woo ||e„||^ = 1, lim„_» 00 ||M(z )e n ||^ = 0. Since {e„}„ GN is compact, there 
exists a subsequence {e„ fc } feeN C {e n } n j^ such that limfc_» 00 ||e„ fc -e \\ n = 0, e G 
ran(i?_i), ||eo||^ = 1. Hence limfc^oo ||Af(zo)e nf! ||^ = implying the contradiction 
M(z )i = 0. Thus, Mizo)- 1 G B(H) and hence M(z)- 1 G 6(H), z G C+ by 
Lemma 2^. In particular, ran(Af(z)) = TL, z 6 C+ and hence Af(z) = ( M W 
has left and right parametrices taking into account (2.10). □ 



Concerning boundary values at the real axis we also recall 
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Lemma 2.5. (Naboko JS3|.) Suppose (M — In) is a Herglotz operator with values 
in B\(TL). Then the boundary values M(X + iO) exist for a.e. A G M in B p {TL)-norm, 
p > 1 and M(A + iO) is a Fredholm operator for a.e. A 6 1 with index zero a.e., 

ind(A/(A + iO)) = for a.e. A G M. (2.15) 

Moreover, 

ker(M(A + iO)) = ker(M(i)) = (ran(Af(A + iO)))" 1 /or a.e. A G K. (2.16) 
In addition, if M(zq)^ 1 G B(Ti) for some (and hence for all) zq G C+, i/ien 

M(A + iO)" 1 G /or a.e. A G K. (2.17) 



Generalizations and a counter example to ( 2.17 ) if (M — In) G Bi(H) is 
replaced by (M — In) G B P (H), p > 1 can be found in J63|. 
Next, let T be a bounded dissipative operator, that is, 

T G Im(T) > 0. (2.18) 

In order to define the logarithm of T we use the integral representation 

/>oo 

\og(z) = -i dX^z + iX)- 1 - (1 + iX)- 1 ), z^-iy,y>0, (2.19) 



with a cut along the negative imaginary z-axis. We use the symbol log(-) in (|2.19|) 
in order to distinguish it from the integral representation 
,o 

ln(z)=/ dX(iX-z)- 1 - A(l + A 2 )" 1 ), *GC\(-oo,0] (2.20) 



with a cut along the negative real axis. Both representations will be used later 
and it is easily verified that log(-) and ln(-) coincide for z G C+. In particular, one 
verifies that ( [2.19 ) and ( 2.20 ) are Herglotz functions, that is, they are analytic in 
C + and 

< Im(log(z)), Im(ln(z)) < n, zE C+. (2.21) 
Lemma 2.6. Suppose T E B(TL) is dissipative and T -1 G B(TL). Define 

/>oo 

\og(T) = -i dX^T + iX)- 1 - (l + iX)- 1 ^) (2.22) 



in the sense of a B(Ti)-norm convergent Riemann integral. Then 

(i) log(T) G B{U). 

(ii) IJT = zl n , z E C+, then log(T) = \og{z)I H - 

(Hi) Suppose {P n }neN C B(TL) is a family of orthogonal finite-rank projections in 
TL with s-limn^oo P n = 1-n- Then 

s-lim((J„ - P n ) + P n TP n ) = T 

n—>-oo 

and 

s-limlog((7 w - P n ) + P n {T + ie)P n ) 
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s-limP„(log(P„(P + ie)P n \ PnH )Pn = log(T + ieI H ), e > 0. 



(iv) n-lim ei0 log(P + ieI H ) = log(T). 

fa) e log(T) = T 

Proof, (i) Clearly log(P) e B{H) since 

||log(T)||< / 5 dA(||(T + *A)- 1 || + l) + 



dA(||T|| + l)A- 



(2.23) 



< 2 



using \\{T + iA)- 1 !! < A -1 , A > 0. Moreover, by ||(T + ^ 1 1 1 < |jr- 1 ||(l — 
IT^HA)- 1 for < A < [IT 7 - 1 !!- 1 , choosing 5 = 2 1 1 1 1 1 1 is sufficient to 
bound the first integral in ( 2.23| ). 

(ii) is obvious from (2.19). 

(iii) For any / G H, e > 0, 

dX {{{I H - P„) + P„(P + ie)P n + zA)- 1 - (P + is + zA)" 1 )/ 

"^((e + A)- 1 )!!/!! 

dX - P„) + P„PP„ - P + ie(P„ - 7„))(T + te + zA)- 1 /^- 1 
dA X- 2 \\{{I n - P n ) + P„(T + ie)P n - T - ieI H )\\ ||/||. (2.24) 

N 

Taking 5 > sufficiently small and N sufficiently large such that the first and 
third integrals in ( |2.24 ) are sufficiently small uniformly with respect to n £ N for 
fixed / G H, it suffices to let n — * oo in the second integral in (2.24) for fixed 6 
and N. 

(iv) One estimates, 

||log(P + fe)-log(P)|| = 



< 



dX {{T + ie + iX)- 1 - (P + iX)- 1 ) 

dx (HP^lKi - (e + A)!^- 1 !!)- 1 + iit-^ki - AHP- 1 !!)- 1 ) 

J™ dXX- 2 < 2<S(||P- I |r 1 - (e + S))- 1 + sS- 1 



(2.25) 



for e + 6 < HP" 1 !!- 1 . Taking 5 < 2- 1 ||p- 1 ||- 1 , e < 6/2 and e j then yields the 
result since S > can be chosen arbitrarily small. 

(v) clearly holds for dissipative n x n matrices and hence 

Jog((I n -P n ) + P n (T+ie)P n ) 



{I H -Pn)+Pn{T + ie)P ni £>0 



(2.26) 



upon decomposing TL = P n H © {In — Pn)W, where P n are orthogonal rank-n pro- 
jections. Since s-linin^oo e An = e A whenever A n: A G B{H.) and s-linin^oo A n = A 



(simply use e B = E^B ra /(m!), B G B{H) and A™ - A m = A*(A 
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A)A m ~ 1 - i ), one infers from (iii) and fl2.26|) e log ( T+l£ ) = T + ie, e > 0. Together 



with (iv) this yields (v). □ 

Lemma 2.7. Suppose T G B(TL) is dissipative and T^ 1 S B{Ti). Let L be the 
minimal self-adjoint dilation of T in the Hilbert space K,^TL. Then 

Im(log(T)) =icP n E L ((-oc,0))\ n , (2.27) 

where Pn is the orthogonal projection in K, onto Ti. and {El (A) }agR * s the family 
of orthogonal spectral projections of L in /C. In particular, 

< Im(log(T)) < nl n . (2.28) 

Proof. By Sz.-Nagy's dilation theorem (see the corresponding result in [{36) , Ch. Ill, 
Sect. 2, Theorem 2.1 for contractions), one infers 

(T + iX)- 1 = P n {L + i\Y l \n, A>0, (2.29) 
where L is the minim al self-adjoint dilation of T in /C. Then the existence of 



T- 1 e B(H) and ( p.29|) yield 

£l({0}) = 0. (2.30) 



In order to prove ( 2.30 ) one can argue as follows. Consider the contraction S — 
(T — i)(T + i) _1 . According to Theorem 3.2, Ch. I, Sect. 3 in [|6|, every contraction 
on the Hilbert space TL corresponds to a decomposition of Ti. into an orthogonal 
sum H = Hq © Hi of two reducing subspaces of S such that the part of S on 
Tlo is unitary, and the part of S on Tii is completely non-unitary. Moreover, this 
decomposition is unique. (We recall that a contraction is called completely non- 
unitary if there are no non-zero subspaces reducing this contraction to a unitary 
operator.) Since T is invertible, the unitary part of the contraction S does not 
have the eigenvalue —1. Since the minimal unitary dilation of a completely non- 
unitary contraction has absolutely continuous spectrum (see J85|), we conclude 
that the minimal unitary dilation of S does not have the eigenvalue — 1 and hence 
the kernel of the minimal self-adjoint dilation L of the dissipative operator T is 
trivial, proving Q2.30| ). Next, ( |2.22p implies 



/"GO 

Im(log(T)) = - / dXRc((T + iX)- 1 - (l + iX)- 1 ^) 
Jo 

= -/ dXRe{{T + iX)- 1 - (l + iX)- 1 ^) 
Jo 

dXRc((L + zA)- 1 - (l + iA)- 1 ^)^ 

= -/ dXRe((T + iX)- 1 - (l + iX)- 1 ^) 
Jo 

/oo 
dX(L(L 2 + A 2 )- 1 -(l + A 2 )- 1 ^)^, £>0. (2.31) 
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Let us prove that 



/oo 
d\L(L 2 + A 2 )" 1 = (7r/2)(P z ,+ - P L ,-), 



(2.32) 



where Pl,+ = £x((0, oo)) and Pl- = J5i((— oo,0)) are the spe ctral projections 
of _L corresponding to t he half -lines (0, oo) and (— oo,0). Then ( 2.27 ) and hence 
( [2.28D follow from ( |2.3lD , (|||) and the fact that 



lim 

ej.0 



/ d\Rc((T + iX)- 1 - (l + iX)- 1 ^] 
Jo 



= 0. 



(2.33) 



Using the estimate | dX^i(p, 2 + A 2 ) -1 < (ir/2), fi G R, e > by the spectral 
theorem for the self-adjoint operator L, we infer that the family of operators 
f dXL(L 2 + A 2 ) -1 is uniformly bounded in e > and therefore, it suffices to 
check the convergence ( |2.32| ) on a dense set in K. A natural candidate for this set 
is V = {{J s>Q E L (R\(-6,6))f\f G K}, which is dense in K since by ( p3o| ) the 
kernel of L is trivial. For / G V we have 



/oo />oo y> 

dAL(i 2 + A 2 )- 1 /= dX 
Je Jr 



^ 2 + X 2 )- 1 dE L {n)f 



R\(-8,6) 

((%/2)-atctim(e/ii))dE L (ti)f- [ ((tt/2) - arctan(e/|A«|))dSz(A*)/ 



^/2){E L {{&, go)) -^((-oo, -*)))/ 

-5 



arctan(e//x)c£Ex(/z)/ 



arctan(e/|^|)<i£ , i(yLt)/ 



(2.34) 



for all <5 > 0, <5 = <$(/) small enough. For fixed / G 2?, going to the limit £ — ► in 
( ^.34D , we get 

p OO 

lim / dXL(L 2 + A 2 )" 1 / = (tt/2)(E l ((6,oo)) - E L ((-oo,6)))f 

£ i° Je 

= {*/2)(P L ,+ -P Lt -)f, (2.35) 



proving ( 2.32 ). 



□ 



Combining Lemmas 2.3 and 2.7 one can prove the following result. 

Lemma 2.8. Suppose M : C+ — > B(Tl) is a Herglotz operator and M(zo) 1 G 
B{TL) for some {and hence for all) zq G C+. Then log(M) : C+ — > B(TL) is a 
Herglotz operator and 

< Im(log(M(z))) < tcI h , z G C+. (2.36) 

Proof. Clearly 



log(M(z)) 



dX((M(z)+iX)- 1 - {l + iX)- 1 ^), ze^ 



(2.37) 



is analytic for z G C+ since M(z) 1 G B(7 i) for all z G C+ by Lemma |2.3[ An 
application of Lemma 2/7 then yields ( 2.36 ). □ 
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Thus applying @.la| ) to log(M(z)) one infers 

log(M(») = C + Dz + R 1/2 {I ic + zL){L~z)- 1 R 1/2 \ H , z £ C+ (2.38) 

for some Hilbert space K, D H, some bounded self-adjoint operators C, < D E 
B(H), a bounded nonnegative operator < i? E £>(7i) with R\^Q n = 0, and a 

self-adjoint operator L = L* in /C. By comparison with scalar Herglotz functions 
one would expect that D = 0. That this is indeed the case is proved next. 



Lemma 2.9. D = in the representation (2.38) for log(M(z)), z £ C+. 
Proof. Consider z — iy, y f oo. Then 

H2T 1 iog(M(iy)) - = Oiy- 1 ) (2.39) 

and by ( g3| ), 

^y' 1 In >y- x Tm{\og(M(iy))) 

= D + y- 1 Im(fi 1 /2( / _ + _ iy)-l#/2| w ) 

= £> + j/ _1 (... > 0...) > D > (2.40) 
and hence D = since y _1 can be chosen arbitrarily small. □ 

Introducing the family of orthogonal spectral projections {-E £ (A)}asR of L 
in K. one can thus rewrite ( |2. 38 ) as 

log(M(z)) = C 

+ f (1 + A 2 )d(i? 1 / 2 £; £ (A)i? 1 / 2 | w )((A - z)- 1 - A(l + A 2 )" 1 ), z E C+ (2.41) 

JR 

and hence 

Im(log(M(z))) = Im(z) / rf(i? 1/2 ^ £ (A)i? 1 / 2 | w )(l + A 2 )|A - z|~ 2 (2.42) 



for z E C+, interpreting both integrals in (2.41) and ( |2.42 ) in the weak sense for 
simplicity. Again by comparison with scalar Herglotz functions one expects that 
d(R 1 ^ 2 Ej j (X) R}' 2 \n) is a purely absolutely continuous operator- valued measure 
on R. This is confirmed by the following result (tr^(-) denotes the trace of trace 
class operators in rt). 

Theorem 2.10. Suppose M : C+ — ► B(TL) is a Herglotz operator and M(zo) -1 E 
B(TL) for some (and hence for all) zq E C+. Then there exists a family of bounded 
self-adjoint weakly (Lebesgue) measurable operators {5(A)}agr C B(H), 

< 3(A) < In for a.e. A E R (2.43) 

such that 

log(M(z)) = C+ /dAS(A)((A-z)- 1 -A(l + A 2 )" 1 ), z E C+ (2.44) 
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the integral taken in the weak sense, where C — C* G B{TL). Moreover, i/Im(log 
(M(zo))) G Bi(H) for some (and hence for all) zq G C+, then 

< E(A) G Bx{H) for a.e. A G K, (2.45) 

0<tr w (S(-))eli c (R;dA), / dA (1 + A 2 )" 1 tr w (5(A)) < oo, (2.46) 

and 

tr w (Im(log(M(z)))) =Im(z) /dA tr w (S(A))|A-z|- 2 , z G C + . (2.47) 



Proof. Let f £ H and denote 

dw/(A) = (l + A 2 )d(i? 1 / 2 /, J B £ (A)i? 1 / 2 /)« = (l + A 2 )rf||S £ (A)i? 1 / 2 /|| 2 ,. (2.48) 
Then 

0< (/,Im(log(M(z)))/)„ <n\\f\\ 2 H: z G C + , /e« (2.49) 

proves that dw/ is purely absolutely continuous, du>f — dujf iac by standard argu- 
ments (see, e.g., j|). Thus, 

duj f (X) = £/(A)dA with < C/(A) < H/H 2 ^ for all / G H and a.e. A G R. 

(2.50) 

By ( |2. 48] ), £/(A) defines a quadratic form 

£/(A) = (/, H(A)/) W for some < 3(A) < J w and a.e. A G K (2.51) 



proving (2.43) and (2.44). The representation ( 2.38)) (with D = 0) implies 



Im(log(A/(z)) = i? 1 / 2 ^ + L 2 )((L - Re(z)) 2 + (Im^)) 2 )" 1 ^/ 2 ^ (2.52) 

and hence Im(log(M(zo))) G B\{TL) for some z G C + implies Im(log(M(z))) G 
Bi(TC) for all z G C+. In particular, 

Im(log(M(i))) = f dA(l + A 2 )~ 1 S(A) (2.53) 

then proves 

tr w (Im(log(M(i)))) = f d\ (1 + A 2 )" 1 trw(3(A)) < oo (2.54) 



by the monotone convergence theorem. Hence < tr-^(S(A)) = ||c(A)||i < oo for 
a.e. Ael completes the proof. □ 

Remark 2.11. For simplicity we focused on dissipative operators thus far. Later 
we will also encounter operators S G B(TL) with — S dissipative, that is, Im(S') < 0. 
In this case S* is dissipative and one can simply define log(S) by 

log(S) = (log(S*))% (2.55) 
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with log(5*) defined as in ( 2.22 ). Moreover, 

log(M(z)) = 6- [ dX §(A)((A - 2)- 1 - A(l + A 2 )" 1 ), z e C+, 
J& 

6 = 6* £ B(H) and < §(A) < I n for a.e. A £ K, 
whenever M is analytic in C+ and \m(M(z)) < 0, z 6 C+. 



(2.56) 
(2.57) 



Remark 2.12. Theorem 2.10 represents the operator-valued generalization of the 



exponential Herglotz representation for scalar Herglotz functions studie d in detail 
by Aronszajn and Donoghue [Q (see also Carey and Pepe Jl9|). Theorem 2.10 is not 
the first attempt in this direction. Carey ijTsI , in 1976, considered the case M(z) — 
I H +K*{H -z)- l K (i.e., A-R^LR^ln = I n , B = 0, (1 + L 2 ) 1/2 R 1/2 \h = K 
when compared to ( |2.1a| )) and established 

M(z) = exp( / d\B(X)(\ - z)~ 



(2.58) 

for a summable operator function B(X), < B(X) < In (i.e., the analog of S(A) in 
( 2.44 )). Although Carey's proof also uses Naimark's dilation theorem as described 
in Theorem 1 of Appendix I of fl6[| , it is different from ours and does not utilize 
the integral representation ( 2. 22] ) for logarithms. 



Remark 2.13. At first glance it may seem that we have been a bit pedantic in 
introducing various branches of the logarithm in ( 2. IE ) and ( [2.20 ). Actually, these 
branches are just a special case of the following family of branches 



iogaO) 



(2.59) 



where 7 Q denotes the ray, 7„ = {( e C + | ( = re la , < r < oo, a € [n, 2n]}. In 
particular, log(-) = log 37r / 2 (-) and since J ^ dX ((1 — A) -1 — A(l + A 2 ) -1 ) = 0, one 
infers ln(-) = log 7r (-). That some care has to be taken in connection with a consis- 
tent choice of branches especially fo r ope rator-valued branches of the logarithm is 
illustrated in the following Remark 2.14 and in Remark 3.3. 



Remark 2.14. In view of our applications in Sections 3 and 4 it seems worthwhile 
to re call in connection with our hypothesis Im(log(M(zo))) £ Bi(Tt) in Theorem 
2~10| that if A e Bi(H) and det w (7 w + A) ^ 0, then \og(I H + A) e Bi(H) by 



(|2.22D (using (I n + A + iX)- 1 - (1 + iX^In = (1 + X)~ l (I n + A + iXj^A.) 



Moreover, detain + A) = IlneN^ + ^n(A)), where X n (A) denote the eigenvalues 
of A repeated according to their algebraic multiplicity, then shows 

tr w (log(/ w + Aj) = log(det H (/ w + A)). (2.60) 

In fact, ( [2.60 ) holds for any branch log Q (-) and hence, in particular, for the branch 
ln(-) on either side of ( 2. 60] ). (Here det-ft(-) denotes the Fredholm determinant for 
operators in TL.) 
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3. The Spectral Shift Operator 
The main purpose of this section is to introduce the concept of a spectral shift 



operator (cf. Definition 3.5) and a new approach to Krein's basic trace formula 
& 

Let TL be a complex separable Hilbert space and assume the following hy- 
pothesis for the remainder of this section. 

Hypothesis 3.1. Let Hq be a self-adjoint operator in TL with domain dom(i/o), 
J a bounded self-adjoint operator with J 2 = In, and K € 62 a Hilbert- Schmidt 
operator. 

Introducing 

V = KJK* (3.1) 

we define the self-adjoint operator 

H = H + V, dom(H) = dom(iJ ) (3.2) 

in TL. 



Given Hypothesis 3.1 we decompose TL and J according to 

7+ 
-I 



J=[ I i _° T ) , TL = TL+®H-, (3.3) 



j+ = r£ 0), j- = r j j, j=j+-j-, (3.4) 

with I± the identity operator in TL±. Moreover, we introduce the following bounded 
operators 

<f>(z) = J + K*(H - z)~ x K : TL -> TL, (3.5) 

$+(z) = 1+ + J+K*(H - z)- x K\ u+ : TL+ -» TL+, (3.6) 

= I- - J-K*(H+ - z)- 1 K\ n _ : TL- -> H-, (3.7) 

for z s C\R, where 

V+ = KJ+K*, (3.8) 

H + =H + V+, dom(iJ+) = dom(# ). (3.9) 



Lemma 3.2. Assume Hypothesis 3.1. Then $, $ + , and — $_ are Herglotz oper- 



ators in TL, TL+, and TL-, respectively. In addition (z G C\M), 

^(z)- 1 = J - JK*(H - zy x KJ, (3.10) 

M*)" 1 = 7 + - J+K*(H+ - ^KW^ (3.11) 

=/_ + J_K*(H - z)- 1 K\ n _. (3.12) 
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Proof. It suffices to consider $ + . Since I + and Hq are self-adjoint, &+(z) = (J4 
J + K*(H — z)~ 1 KJ + )\ u + is clearly analytic in C\K and satisfies Im($ + (z)) > 



for z G C+. Relation (3.11) is then an elementary consequence of the second 
resolvent equation, 

(H+ - z)- 1 = {H - z)- 1 - (H - z)- 1 V+{H+ - z)- 1 (3.13a) 

= {Ho - z)- 1 - (H+ - z^V+^Ho - z)-\ (3.13b) 

the fact ,l\ = J + , and simply follows by multiplying the right-hand sides of (|3.6|) 



and (3.11) in either order. □ 



Remark 3.3. In the following we need to make use of the formula 

dtv,c(log{I K + F(z)))/dz = tx K (F'(z){I K + F{z))- 1 ). (3.14) 

This result is proven, for instance, in ^8|, Ch. I, Sect. 6.11 or jy], Sect. IV. 1 for 
analytic F(-) G Bi(/C) in some region !lcC such that (Jk_+ ^(O) -1 G B(K) in 9, 
using the standard branch ln(-). In this case equation ( |3.14 ) is first proven in the 



finite-dimensional case, followed by taking the limit n — > 00 upon replacing F(z) 
by P n F(z)P n , with P n a family of orthogonal projections in K, strongly converging 
to I/c as n — > 00 (cf., p^j , p. 163). This strategy of proof extends to all branches 
log a (-) introduced in Remark 2.13| . More generally, we have the following result, 



dix K (<p(F(s)))/ds = ix K {v>' (F(s))F'(s)), (3.15) 

with F(s) € B\(1C) defined on an interval s% < s < S2, continuously differentiable 
in 23i(/C)-norm, and ip(z) any scalar function, holomorphic in some domain PcC 
with a Jordan boundary and spec(F(s)) C T> for all s G [si, S2]. 



Lemma 3.4. Assume Hypothesis 3.1 and C\R. Then 

tr H ((H - z)- 1 - (H+ - z)- 1 ) = dtr H+ {\og($ + (z)))/dz, (3.16a) 
tT H ((H + - z)- 1 — (H — z)- 1 ) = dtr w _(log($_(z)))/dz. (3.16b) 



Proof. It suffices to prove ( |3.16a ). Applying ( |3 . 13| ) repeatedly one infers 



tv H ((H - z)- 1 - (H+ - z)- 1 ) 

= tr H ((flo - z^V+^Ho - z)- 1 - (H+ - z^V+iHo - z)- 1 }) 

= tv H (J + K*(Ho - z)- 2 KJ+[J+ - J+K*(H+ - z)- 1 KJ+\) 

= tr w (J+<D' + (z)4$ + ( z )- 1 J + ) = tT H+ {& + {z)®+( z )~ X ) 

= dtr H+ (log(3>+(z)))/dz, z G C\M, (3.17) 

where we used J\ = J+, 

tv H (AB) = tv H {BA) (3.18) 
for A,B £ B(H) with AB, BA G Bi(H) (cf. Corollary 3.8 in @), and (jjuj). □ 
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Next, applying Theorem 2.10 and Remark 2.11 to Q+(z) and $-(2) one 
infers the existence of two families of bounded operators {3±(A)}agr defined for 
(Lebesgue) a.e. Ael and satisfying 



and 



< S±(A) < I±, H±(A) G Bx(H±) for a.e. Ael, 
||H ± (-)|| 1 €£ 1 (R;(l + A 2 )- 1 rfA) 

log($ + (z)) = log(/+ + J+K*(H - z)- 1 ^) 

= C++ [ dAS+(A)((A-z)- 1 -A(l + A 2 )- 1 ), 

log($_(z)) = log(I_ - J_K*(H + - z)- l K\ H _) 

= C_- / dAS_(A)((A-z)- 1 -A(l + A 2 )- 1 ) 



(3.19) 



(3.20a) 



(3.20b) 

for z e C\M, with C± = C* ± E Bx{H). 

Equations (3.2C) motivate the following 

Definition 3.5. S + (A) (resp., S_(A)) is called the spectral shift operator associ- 
ated with $+(z) (resp., $_ (z)). Alternatively, we will refer to 3 + (A) as the spectral 
shift operator associated with the pair (H , H + ) and occasionally use the notation 
S + (A, H ,H + ) to stress the dependence on (H ,H + ), etc. 

Moreover, we introduce 

£±(A) = tr w± (3±(A)), < £± G L 1 (K; (1 + A 2 ) _1 dA) for a.e. Ael. (3.21) 

Actually, taking into account the simple behavior of <$>+(iy) and <fr-(iy) as 



00, one can improve (|3.20aD and ( |3.20b| ) as follows. 



Lemma 3.6. Assume Hypothesis 3.1 and define £± as in (3.21). Then 

< £± G L\R;d\), 
and (3.20a) and ( |3.20b ) simplify to 

log(*+(z)) = ^dA2+(A)(A- z)-\ 



log($_(z)) = - / d\^{\){\-z)-\ 



Moreover, for a.e. A G 



lim||3+(A) -7r 1 Im(log($ + (A + ie 

eJ.0 



Bi(H+) 



= 0, 



hm ||3_(A) + ^- 1 Im(log(0>_(A + ie)))|| Bl(w _) = 0. 



(3.22) 

(3.23a) 
(3.23b) 

(3.24a) 
(3.24b) 
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Proof. It suffices to consider £+(A) and <f> + (z). Since 

||log($ + (y))|| 1 =0(|y|- 1 ) as M-oo (3.25) 

by the Hilbert- Schmidt hypothesis on K and the fact \\(H — «y) _1 || = 0(|y| _1 ) 
as 1 2/ 1 — * oo, the scalar Herglotz function tr^ + (log(<i> + (z))) satisfies 

|tr„ + (log($+(z)))| = 0{\y\~ 1 ) as |y| - oo. (3.26) 

By standard results (see, e.g., @), ( ^.26| ) yields 

tr w+ (log($+(z))) = J du> + {X){X-z)~\ zgC\R, (3.27) 

where w+ is a finite measure, 

dw+(A) - -t Hm(j/tr„ + (log($ + fz)))) < oo. (3.28) 

yjoo 

Moreover, the fact that Im(log($+(z))) is uniformly bounded with respect to z G 
C+ yields that lu+ is purely absolutely continuous, 

duj+(X) = £+(A)dA, e+ 6 L\R;dX), (3.29) 

where 

£+(A) = 7 r- 1 lim(Im(tr w+ (log(<i> + (A + l e))))) =tr w+ (H+(A)) 

elO 

= tt" 1 lim(Im(log(det w , ($+(A + is))))) for a.e. A G R. (3.30) 

ej.0 

In order to prove ( |3.24a ) we first observe that Im(log($_|_(A+ie))) takes on bound- 
ary values Im(log($+(A + iO))) for a.e. A G R in i3i(H+)-norm by (|J). Next, 
choosing an orthonormal system {e n } ng N C we recall that the quadratic 

form (e„,Im(log($(A + iO)))e„)-^ + exists for all A G R\£„, where £ n has Lebesgue 
measure zero. Thus one observes, 

lim(e m ,Im(log($ + (A + ie)))e n ) w = (e m ,Im(log($+(A + iO)))e n ) H+ 
= 7r(e m ,E+(X)e n ) n+ for A e R\{£ m U £ n }. (3.31) 
Let £ = L) n& ^£ n , then \£\ — (| • | denoting the Lebesgue measure on R) and hence 
(/ ! Im(log($ + (A + iO))) 5 ) w+ =7r(/,S+(A) 5 ) w+ (3.32) 
for A G R\£ and f,g eV = lin.span {e„ G | n G N}. 

Since 2? is dense in H+ and 3+ (A) G B(TL+) one infers Im(log(<I>+(A + iO))) = 
7tH + (A) for a.e. A G R, completing the proof. □ 

Of course ( 3.24a ), ( 3.24b|) (and the method of proof ) immediately extend to 
a.e. nontangcntial limits to the real axis. 
Assuming Hypothesis |3.l| we define 

f (A) = £+(A) - £_(A) for a.e. A G R (3.33) 
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and call £(A) (resp., £+(A), £_(A)) the spectral shift function associated with the 
pair (H ,H) (resp., (H ,H + ), (H + ,Hj), sometimes also denoted by £(\,H ,H), 
etc., to underscore the dependence on the pair involved. 

M. Krein's basic trace formula [Bl] is now obtained as follows. 



Theorem 3.7. Assume Hypothesis \3. \ . Then (z £ C\{spec(i?o) Uspec(i7)}) 

tr H {(H - z)- 1 - (H - z)- 1 ) = - [ dA£(A)(A - z)- 2 . (3.34) 

JR 

Proof. Let z e C\R. By ( gg^ ) and (|T2|) we infer 

tr w+ (log($+(«))) = / dAC+(A)(A - z)-\ (3.35) 



tr H _(log($_(z))) = - / dAC-(A)(A-z)- 1 . (3.36) 



Adding ( 3.16a| ) and ( 3. 16b| ) , differentiating (3.35) and ( 3.36| ) with respect to 



z 



proves (3.34) for z S C\R. The result extends to all z 6 C\{spec(i/o) U spec(i?)} 



by continuity of ((H — z) 1 — (Hq — z) v ) in £>i (H)-norm. □ 



In particular, £(A) introduced in (3.33) is Krein's original spectral shift func- 



tion (up to normalization) . As noted in Section |2|, the spectral shift operator S + (A) 
in the particular case V = V+, and its relation to Krein's spectral shift function 
£+(A), was first studied by Carey Q in 1976. 



Remark 3.8. (i) As shown originally by M. Krein pi] , the trace formula ( [3.34 ) 
extends to 

tr(/(fl) - f(H )) = [ d\£{\)f'{\) (3.37) 



for appropriate functions /. This fact has been studied by numerous authors and 
we refer, for instance, to §, Ch. 19, @-||, H, 11, [H, @, @, @, 
Ch. 8 and the references therein. 

(ii) While we focus here on pairs of self-adjoint operators (Ho, H), M. Krein in his 
original paper |5l| also considered pairs of unitary operators. Given the conformal 
equivalence of C+ and the open interior of the unit disk D in C, this corresponds 
precisely to the study of Nevanlinna, respectively, Riesz-Herglotz functions in C+, 
respectively, D. Pairs of unitary operators are also studied, for instance, in jyj, 
Jr2[ , p8[ , Q, pa|. The trace formula in the case of non-self-adjoint and non- 
unitary pairs of operators is a rather challenging problem. The interested reader 
can get some insight into this matter by consulting Q , [|3| , |55| , Q and the ex- 
tensive literature cited therein. Similarly, the case of non-trace class perturbations 
and associated trace formulas has been studied by various authors. We refer, for 
instance, to J75[ , where Hilbert-Schmidt perturbations of unitary operators are 
treated in depth. 

(iii) While scattering theory for the pair (Hq, H) is not discussed in this paper, we 
remark that £(A), for a.e. A £ spec ac (i?o) (the absolutely continuous spectrum of 
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Ho), is related to the scattering operator at fixed energy A by the Birman-Krein 
formula ||, 

dct Hx (S(X, H , H)) = e- 27rl ^ X) for a.e. A G spec ac (# ). (3.38) 

Here S(X, Hq, H) denote the fibers in the direct integral representation of the 
scattering operator 

S(H ,H)= ^ dXS{X,H ,H) in H = f & dXH x 

with respect to the absolutely continuous part H Qt ac of H . This fundamental 
connection, originally due to Birman and Krein 0, is further discussed in 0, 
Ch. 19, ||, @, @, ||, [g§, |§, Ch. 8 and the literature cited therein, 
(iv) The standard identity (glj, Sect. IV. 3) 

tr H ((H - z)- 1 - (H a - z)- 1 ) = -dlog(det H (I H + V(H - *) _1 ))/<fe (3.39) 



together with the trace formula ( 3.34 ) yields the well-known connection between 



perturbation determinants and £(A), also due to M. Krein |5l|] 

log(det„(/„ + V(H - z)- 1 )) = f dA£(A)(A - z)' 1 , (3.40) 



£(A) = lim7r~ 1 Im(log(det H (/ w + V(H - (A + iO))" 1 ))) for a.e. A G K, (3.41) 



tr H (V) = / d\£(\), / dA|e(A)| < ||V||i. (3.42) 
This is discussed in more detail, for instance, in §, Ch. 19, 111, @, M, |1, 



§J, §§-|J, (88). Relation ( pd| ) and the analog of for d{AE H (X)B)/d\, 

where ff = iT + V, V = B*A, A, B e B 2 {H), V = V* , leads to the expression 

£(A) = (-27TZ)- 1 tr„(log(7„ - 2tu(J w — A(H — X — tO)- 1 B*)(d(AE H (X)B*)/dX))) 

for a.e. Aet (cf., e.g., §, Sects. 3.4.4 and 19.1.4). 

(v) The invariance principle for wave operators of the pair (-ffo, H) implies 

£(A) =Z(\Ho,H)=sgn(*'(X))Z(*(X),*(H ),*(H)) (3.43) 

for a certain class of admissible functions 'J. In certain cases (e.g., if H and if 
are bounded from below) this can be used to define £ even though H — H = V is 
not of trace class as long as (^(H) — ^>(Ho)) G Bi(H). Prime candidates for W in 
such cases are semigroup (^E'(A) = e~ tx ,t > 0) and resolvent (\&(A) = (A — z)^ 1 , 
z G C\R or z < Eo for some Eo G R) functions. Pertinent facts in this connection 
can be found in @, @, and |§, Sect. 8.11. 

(vi) For simplicity we chose a single Hilbert space formulation throughout this sec- 
tion. However, every result immediately extends to the case where K G £>2(/C,7i), 
J = J* G B(K), J 2 = Ik. and /C is another complex separable Hilbert space. 
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Remark 3.9. Suppose Hq,H\ 1 and H 2 are self-adjoint operators in H with {Hj — 
H k ) G Bi(H) for all j,k G {0,1,2}. Denoting by i{X,Hj,H k ) the Krein spectral 
shift function of the pair {Hj, H k ) such that 

tx n ((H k - z)- 1 - {Hj - z)- 1 ) = - f d\£{\Hj,H k ){\ - z)" 2 , (3.44) 

the chain rule, 

Z{\,H ,H 2 )=£{\,H ,H 1 ) + t{\,H 1 ,H 2 ) a.e., (3.45) 

together with 

£,{\,H 3 ,H k ) = -£(\,H k ,Hj) a.e., (3.46) 

and 

£{\,Hj,H k ) > a.e. if {H k - Hj) > 0, (3.47) 

imply the monotonicity property 

Z{X,H ,H 2 ) > £(A, H Ql Hi) a.e. if H 2 > H x . (3.48) 

Here ( |3.4 7 D is clear from ( |3.19|) ( |3 . 2 1| ) . Equation (|^) follows from ( |3.44| ), and 
(3.45) is a consequence of (3.41) observing the facts 

In + V{H - z)- 1 ={H + V- z){H - z)~\ V G B{H), 

det w ((I w + A)(J W + £)) = det w (I w + A)det w (/ W + B), A,B G B X (H). 



Given the monotonicity property ( 3.48 ) of Krein's spectral shift function, it 
is natural to inquire whether or not this property is shared by the spectral shift 
operator. More precisely, one might ask whether or not 

E{X,H ,H 2 ) > E{X,H ,H 1 ) a.e. if H 2 >H t > H . (3.49) 
The following simple counter example destroys such hopes. 

Example 3.10. Let H = 0, K G B 2 {H), J = In, and hence ${z) = In ~ 
K*Kz- x , z G C\{0}. Then 

hm || log(7„ - K*K{X + is)- 1 ) - log{I n - K* K\~ 1 )\\ Bl i yH ) = (3.50) 

for A e R\{spec(K*K) U {0}} and 

7r _1 Im(log(/ w — K*K\~ 1 )) — S(A), A G M\{spec(K*K) U {0}}. (3.51) 

Decomposing the self-adjoint operator In — K*KX~ l , A G R.\{spec(-ftT*-K') U {0}} 
into its positive and negative spectral parts then yields S(A) = 9{K*K — XIn), 
X G K\{spec(i ; £T*-fr)U{0}}, where 9{-) denotes the usual step function (i.e., 9{x) = 1 
for x > and 9{x) = for x < 0). 

Next we choose H = C 2 , < a < b < c < 1, ac - b 2 > 0, and 

K*K 1 =(] f\, spec(^x) = {1 - b, 1 + b}, (3.52) 
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with eigenvectors 2 _1 / 2 (1, ±1)' associated to the eigenvalues 1 ± b and 

K*K 2 =(^ + a spec(# 2 *# 2 ) = {l + a,l + c}, (3.53) 

with eigenvectors (1,0)* (resp., (0,1)*) associated to the eigenvalue 1 + a (resp., 
1 + c). Finally, choosing A G (1 + a, 1 + b) then yields 

Si (A) = B{K\K X - XI C 2) = E k;Ki {{1 + 6}), (3.54) 
3 2 (A) = 6{K*K 2 - XI C 2) = E KSK2 ({1 + c}), (3.55) 

where \Ek*k (A)}asr denotes the family of orthogonal spectral projections of 
K*K. Clearly H 2 = K*K 2 > K\K X = Hi > H = but S 2 (A) ^ Si (A) for 
A G (1 + a, 1 + b) since one-dimensional self-adjoint projections cannot satisfy 
an order relation unless one is a real multiple of the other. (Note, however, that 
tr(S 2 (A))= l + c>l + 6 = tr(S x (A)) in accordance with fl3.48|).) 



This example shows, in particular, that the chain rule ( 3.45 ) for £(A) does 
not extend to S(A). 

4. Spectral Averaging: An Operator-Theoretic Approach 

In this section we apply the formalism developed in Sections || and || to 
provide an effortless proof of spectral averaging and its relation to Krein's spectral 
shift function as originally proven by Birman and Solomyak jllj . 

For the basic setup of this section we assume the following hypothesis. 

Hypothesis 4.1. Let Hq be a self-adjoint operator in TL with dom(iJo), and as- 
sume {V(s)} S £n C B\(TL) to be a family of self- adjoint trace class operators in TL, 
where f2 C R denotes an open interval. Moreover, suppose that V(s) is continu- 
ously differ entiable with respect to s G fl in trace norm. 

To begin our discussions we temporarily assume that V(s) > 0, that is, we 
suppose 

V(s) = K(s)K(s)*, sen (4.1) 



for some K(s) G £> 2 (7i), s G O. Given Hypothesis 4.1 we define the self-adjoint 
operator H (s) in TL by 

H{s) = H + V(s), dam(H(s)) = dom(#o), s G Q. (4.2) 



In analogy to (3J5) and (3J3) we introduce in TL (s G 17, z G C\R), 

$(z, s)=I H + K(s)*(H - z)- l K(s) (4.3) 

and hence infer from Lemma |3.2| that 

$(z, s)' 1 =I H - K(s)*{H{s) - z^Kis). (4.4) 

The following is an elementary but useful result needed in the context of Theorem 
4.3. 
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Lemma 4.2. Assume Hypothesis \ and (4.1). Then (s £ CI, z G 

dtr„(log($(z, s)))/ds = tv H (V'(s)(H(s) - z)- 1 ). (4.5) 
Proof. By fl3.15| ), ( |4.3| ), and ( |4.4| ) one infers for z = iy, |y| > sufficiently large, 
dtr w (log($(z, s)))/ds = dtr w (log(7 w - ^(«)(ff - z) _1 ))/ds 

OO 

= (d/ds)j2(-iy(3+ir l tM(v(s)(H -z)- i y +i ) 

3=0 

OO 

= tr H (V'(s)((H - z)^ '(^X^o - 

= tr w (V / («)((H( 8 )-«)- 1 ) (4.6) 

by repeated use of ( 3.18Q and (3.13a). Analytic continuation of (4.6) with respect 
to z e C\M then proves (TO). □ 



Next, applying Lemma 3.6 to <&{z, s) in (4.3) one infers (s £ fi), 



k>g(*(z,a))= / dA3(A,s)(A-z)-\ 

< H(A, s) < J w , S(A, s) e for a.e. A e E, 

||S(.,a)||i el^l^A), 



(4.7) 
(4.8) 



where 5(A, s) is associated with the pair (H , H(s)), assuming H(s) > H , s e fi. 

Our principle result on averaging the spectral measure of {Eu( s ) (A)}agR of 
H(s) then reads as follows. 



Theorem 4.3. Assume Hypothesis 4-1 and [si,S2] C Cl. Lei £(A, s) be the spectral 
shift function associated with the pair (Hq, H(s)) (cf. ( |3.33 )) 7 where H(s) is defined 
by (4.2) (and we no longer suppose H(s) > Hq). Then 

' ds (d(tr n (V'(s)E H(s) (A)))) = (£(A, s 2 ) - £(A, *i))dA. (4.9) 

Proof. Fir st w e prove J4.9| ) in the case V(s) > 0. The monotone convergence 
theorem, (4.7), and Lemma p~2| then yield (z E C\R), 

tr„( / dA((A-Re(z)) 2 + (Im(z)) 2 )- 1 Im(z)(S(A, S2 )-S(A, Sl )) 

dX ((A - Rc(z)) 2 + (Im(z)) 2 )- 1 Im(z)(e(A, s 2 ) - £(A, s x )) 
tr w (Im(log($(z, s 2 )))) - tr w (Im(log($(z, si)))) 

jTda ^tr w (Im(log($(z, S ))))^ = dstvu(V'(s)Im((H(s) - z)- 1 )). 
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By the spectral theorem applied to H(s) one obtains 

lm((H(s) - z)" 1 ) = Im(z) J dE H[s) (A)((A - Re(z)) 2 + (Im(z)) 2 )- 1 (4.10) 
and hence 

dX ((A - Re(z)) 2 + (Im(z)) 2 )- 1 (e(A, s 2 ) - £(A, Sl )) 



ds tl-H 



dE H{s) (X))((X - Re(z)) 2 + (Im(z)) 2 )-V( S ) 



(4.11) 



Decomposing the self-adjoint trace class operator V'(s) into its positive and neg- 
ative parts, 

V'(s) = (V'(s)) + (V'(a)U < (V'(s)) ± G B 1 (H), (4.12) 
the monotone convergence theorem yields 

dX ((A - Re(z)) 2 + (Im(z)) 2 )~ 1 (e(A, s 2 ) - £(A, Sl )) 



ds jf R ((A - Re(z)) 2 + (Im(z)) 2 )- 1 (d(tr w ((T/'(.s))V 2 ^ (s) (A)(l/'( S ))f ) 
d(tr w ((T/'( S )) V2 ^(,)(A)(F'( S ))i /2 ))) 



ds / ((A - Re(z)) 2 + (Im(z)) 2 )- 1 ci(tr w (F'( S )£; ff(s) (A))) 



((A-Rc(z)) 2 + (Im(z)) 2 y 



ds(d(tv H (V'(s)E H(s) (X)))) 



(4.13) 



using Fubini's theorem in the last step. Thus, by the uniqueness property of Poisson 
transforms, 



(£(A,* a )-£(A,*i))dA 



ds(d(tr n (V'(s)E H{s) (X)))). (4.14) 



In the case of arbitrary V(s) G Bi(7i) (not necessarily nonnegative), we argue 
as follows. Define (cf. ( 4.12 )) 

fS2 



w = 



ds(V'(s))- )-V( Sl ), 



(4.15) 



then W G Bi(H) and VW + W > for all s G [s 1 , s 2 ]. Equation (|h9|) now follows 
from the chain rule (3.45) for spectral shift functions 

t(\,H ,H(8))=Z(\ i H 0) Ho - W)+Z(X,H - W,H(s)). (4.16) 

Indeed, (V(s) + W)' — V'(s), and £(A, H ,H — W) is independent of s and hence 
drops out on the right-hand side of (4.9). Moreover, the pair (Hq — W, H(s)) only 
involves the nonnegative perturbation V(s) + W of Ho — W in H (s) = Hq — W + 
(V(s) + W) so that Lemma 4.2 becomes applicable as in (4.14) in the first part of 
our proof. □ 
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Remark 4.4. (i) In the special case of averaging over the boundary condition pa- 
rameter for half-line Sturm-Liouville operators (effectively a rank-one resolvent 



perturbation problem), Theorem 4.2 has first been derived by Javrjan |}42[ . [43|. 
The case of rank-one perturbations was recently treated in detail by Simon [79|. 
The general case of trace class perturbations is due to Birman and Solomyak 
using an approach of Stieltjes' double operator integrals. Birman and Solomyak 
treat the case V(s) = sV, V £ B\{K), s e [0, 1]. As explained in g, Q, and @, 
the authors were interested in a real analysis approach to the spectral shift func- 
tion in contrast to M. Krein's complex analytic treatment. In this way the local 
integrability of £ could not be obtained directly (only its property of being a gen- 
eralized function was obtained) although it follows of course from the uniqueness 
£ (up to additive constants). While our operator theoretic approach is intrinsically 
complex-analytic, and hence very much in M. Krein's spirit, it leads to a natural 
pro of o f the absolute conti nui ty of the (signed) measure on the right-hand side 



of (4.S). A short proof of (4.9) (assuming V'(s) > 0) has recently been given by 



Simon |8j. 

(ii) We note that variants of (4J3) in the context of one-dimensional Sturm- 
Liouville operators (i.e., variants of Javrjan's results in ji2j, Q) have been re- 
peatedly rediscovered by several authors. In particular, the absolute continuity 
of averaged spectral measures (with respect to boundary condition parameters or 
coupling constants of rank-one perturbations) has been used to prove localization 
properties of one-dimensional random Schrodinger operators (see, e.g., Jl7| , po| , 
Q, @, Ch. VIII, [pi, ||, g7Hp , Q, Ch. V, f§, l79|). 

(iii) We emphasize that Theorem [f.Sj applies to unbounded operators (and hence to 
random Schrodinger operators bounded from below) as long as appropriate relative 
trace class conditions (either with respect to resolvent or semigroup perturbations) 
are satisfied. 

(iv) In the special case V (s) > 0, the measure 

d(t rn (V'(s)E H(s) (X))) = d(trn(V' {s) 1 / 2 E H(s )(X)V' (s) 1 ^ 2 )) 



in (|4.9|) represents a positive measure. 



(v) The result (4.9) is not restricted to a one-dimensional parameter space s £ 
[s±, S2]. In fact, if 7(si, S2) denotes an oriented piecewise C 1 -path connecting Si 6 
R" and S2 £ K™, one obtains analogously, 

dB.(d(tr w ((VV)(8)B jrw (A)))) = K(A,8 a )-C(A,si))dA. (4.17) 

'7(si,s 2 ) 

We omit further details. 

In the special case of a sign-definite perturbation of Ho of the form sKK*, 
one can in fact prove an operator- valued averaging formula as follows. 

Theorem 4.5. Assume Hypothesis \3. i| and J = Iu- Then 

[ dsd(K*E Ho+sKK ,(X)K) =H(A)dA, (4.18) 
Jo 
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where S(-) is the spectral shift operator associated with 

<S>{z) = I H + K*{Hv-z)- 1 K, zeC\M, (4.19) 

that is, 

log($(2)) = J d\E(X)(X- z)-\ zeC\l, (4.20) 

< 3(A) G Bi(H) for a.e. A G R, ||S(-)||i € L 1 ^; dX). (4.21) 
Proof. An explicit computation shows 
(A-$(z))- 1 = -(l-A)- 1 

x {I H - (1 - X^K^Hq + (1 - X)~ l KK* - z)~ l K) e B{H) (4.22) 

for all A < 0. Since log($(z)) = ln(<f>(z)) for z G C+ as a result of analytic 
continuation, one obtains 



log(*(*)) = / dXE(X)(X - z)- 1 
= ln($(z)) = / dX ((A - ^(z))- 1 - A(l + X 2 )- 1 ^) 

DO 
I 

dX (1 - X)- 2 K*(H + (1 - X^KK* - z)~ l K 

ds f d{K*E Ho+sKK ,{X)K){X- z)- 1 

= f (X-z)- 1 f dsd(K*E Ho+sKK ,(X)K) (4.23) 
Jm Jo 

proving ( |4.18| ). (Here the interchange of the A and s integrals follows from Fubini's 
theorem considering (4.23) in the weak sense.) □ 



As a consequence of Theorem |4.5| one obtains 



f ' dsd{K*E Ho+sKK * (X)K) = 3(A, fl2 ) - S(A, s x ), 

J Si 



(4.24) 



where S(A, s) is the spectral shift operator associated with <f>(z, s) = I?i+sK*(H — 
z)- x K, s G [si,s 2 ]. 
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